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1.  Introduction 

Stochastic  dominance  provides  a way  of  analyzing  risky  decisions  when  a 
decision  agent's  von  Neumann-Mor gens tern  (1947)  utility  function  u is  not 
fully  known  but  is  presumed  to  be  in  a class  U of  real-valued  functions  defined 
on  a consequence  space  X.  When  p and  q are  probability  measures  on  a suitable 
algebra  of  subsets  of  X,  we  sometimes  say  that  p stochastically  dominates  q 
with  respect  to  U if  Ju(x)dp(x)  > Ju(x)dq(x)  for  all  u 6 U for  which  the 
expected  utilities  are  finite. 

Although  stochastic  dominance  is  relevant  whenever  decision  alternatives 
are  described  by  probability  measures,  its  most  popular  use  involves  situations 
in  which  X is  a set  of  real  numbers  with  x preferred  to  y when  x > y.  This 
context  is  emphasized  in  the  introduction  to  stochastic  dominance  given  by 
Fishburn  and  Vickson  (1978)  and  will  be  the  context  used  for  the  present  paper. 
Definitions  of  various  degrees  of  stochastic  dominance  in  the  real  X setting 
can  be  given  without  explicit  mention  of  U classes,  and  I will  follow  this 
approach  here. 

My  purpose  is  to  establish  a general  connection  between  nth-degree 
stochastic  dominance  and  the  first  n moments  of  probability  measures  whose 
supports  are  bounded  below,  for  all  n € {1,2,...}.  It  will  be  assumed  that 
the  measures  are  countably  additive  and  that  their  supports  are  bounded  below 
by  0.  The  corresponding  distribution  functions  F,G,...  will  be  taken  to  be 
continuous  from  the  right  with  F(x)  ■ 0 for  all  x < 0 and  F(x)  + 1 as  x + ». 

The  basic  definitions  and  main  theorem  are  presented  and  discussed  in  the 
next  section  where  the  main  theorem  is  observed  to  follow  from  two  auxiliary 
theorems.  Proofs  of  the  latter  theorems  are  given  in  the  final  two  sections. 
These  proofs  are  based  solely  on  the  distribution  functions  and  will  not  Involve 
utility  classes. 
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2.  Definitions  and  Theorems 

Let  F be  the  set  of  all  right-continuous  distribution  functions  on  the 
real  line  with  F(x)  ■ 0 for  all  x < 0 for  each  F € F.  For  each  F € F let 
F1  ■ F and  recursively  define 

F1*"*"1  (x)  ■ jFn(y)dy  for  all  x >_  0 and  n € {1,2,...}. 
o 

Nonstrict  (>  ) and  strict  (>  ) nth-degree  stochastic  dominance  relations  are 
— n n 

then  defined  on  F as  follows: 

F > G iff  Fn(x)  < Gn(x)  for  all  x € [0,°°), 

— n — 

F > G iff  F / G and  F > G. 
n — n 

These  relations  are  transitive  and  increasingly  more  inclusive:  >1  cic-.. 

1 2 

and  > c >c. . . where  c denotes  proper  inclusion.  Utility  classes  that  are 

congruent  with  the  first  few  stochastic  dominance  orders  are  presented  in 

Fishburn  and  Vickson  (1978,  pp.  102-113).  For  example,  F >_  G iff  Ju(x)dF(x)  >_ 

Ju(x)dG(x)  for  all  nondecreasing  u on  [0,°°)  for  which  the  expectations  exist, 

and  F > G iff  JudF  > judG  for  all  nondecreasing  and  concave  u on  [0,<*>)  for 
— 2 — w 

which  the  expectations  exist.  The  corresponding  theorems  for  (>  ,>)  and  (>  ,>) 

l 2 

respectively  involve  strictly  increasing  u and  strictly  increasing-strictly 
concave  u.  Congruent  utility  classes  for  the  third-degree  relations  are  the 
subsets  of  the  second-degree  classes  in  which  first  derivatives  exist  and  are 
convex  (>  ) or  strictly  convex  (>  ).  Although  it  is  traditional  to  define 
F _>  G iff  F3(x)  £ G3(x)  for  all  x and  the  mean  of  F is  as  great  as  the  mean 
of  G (e.g.,  [Whitmore,  1970],  [Fishburn  and  Vickson,  1978]),  we  shall  see  that 
the  condition  on  the  means  is  redundant  in  the  present  formulation. 
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The  moment  sequence  for  F € F is  y(F)  - (y®  ,yi,y* . . . ) where  y°  ■ 

r F F F 

JdF(x)  - 1 and  y£  - JxndF(x)  for  each  n > 1.  The  part  of  this  sequence 

through  the  nth  moment  will  be  denoted  as  y (F)  • (y® y")  . Our  main 

n F F 

theorem  will  be  based  on  binary  relations  >*  on  (y(F):  F 6 F}  defined 

n 

lexicographically  on  the  basis  of  y as  follows: 

n 

y(F)  >*  y(G)  iff  y (F)  i y (G)  and  (-l)k-1yk  >(-l)k_1  y5 

n ii  u f (j 

lr  lr 

for  the  smallest  k for  which  y_  i y . 

r G 

For  example,  y(F)  >*  y(G)  iff  y*  > yi, , and  y(F)  >*  y(G)  iff  either  yi,  > yi, 

1 r u 2 F G 

or  (y£  - y^,  y|  < y£) . Alternatively,  y(F)  >*  y(G)  iff  either  F has  the 
greater  mean  or  the  means  of  F and  G are  equal  and  F has  a smaller  variance 
than  G. 

Theorem  1.  ^f  F,G  € F and  the  moments  of  F and  G through  order  n are 

finite,  then  y(F)  >*  y(G)  if  F > G. 

n — n 

For  discussion  purposes  the  next  three  paragraphs  assume  that  the  moments 
involved  therein  are  finite. 

The  n ■ 1 part  of  Theorem  1 says  that  if  F > G then  the  mean  of  F must 

l 

be  greater  than  the  mean  of  G.  Equivalently,  if  yi,  < yi,  then  F cannot 

F G 

strictly  first-degree  stochastically  dominate  G.  This  fact  seems  to  be  widely 
known. 

It  appears  to  be  less  commonly  recognized  that  F > G implies  that  either 

2 

y^  > y^  (in  which  case  the  inequality  on  second  moments  could  go  either  way)  or 
(Up  * y^,  y*  < y*).  Parts  of  this  result  have  been  noted  by  Hanoch  and  Levy 


(1969),  Rothschild  and  Stiglitz  (1970)  and  Fishburn  and  Vickson  (1978,  p.  78). 


■ 
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* 11 

For  n ■ 3,  Theorem  1 says  that  F > G entails  either  y_  > y*  or 

3 r Qj 

(y£  - y^,  y£  < y*)  or  (y£  - y3,  y*  - y*,  y3  > y£) . Although  parts  of 

this  have  been  noted  by  Whitmore  (1970)  and  Fishburn  and  Vickson  (1978, 

pp.  78-82)  for  special  cases,  the  complete  result  appears  to  be  previously 

unknown.  In  like  manner.  Theorem  1 for  n £ 4 appears  to  be  new. 

Since  F > G implies  yi,  > yi,  when  these  means  are  finite — and  even 
3 F “ G 

when  one  or  both  are  infinite — our  definition  of  > (or  > ) is  formally 

3 “3 

equivalent  to  the  traditional  definition.  It  should  be  noted  however  that 
upper-bounded  definitions  are  not  so  simply  related.  In  particular,  if  [0,b] 
with  b finite  includes  the  supports  of  F and  G then,  as  shown  in  Fishburn  (1976), 
it  is  possible  to  have  y^  < \1q  when  F3(x)  £ G3(x)  for  all  x € ( 0 , b ] . In  this 
case,  when  F3(x)  £ G3(x)  is  specified  only  over  an  interval  that  includes  the 
supports  of  F and  G,  it  is  necessary  to  include  the  stipulation  y^  £ y^  in  the 
definition  of  F > G if  we  want  > to  be  congruent  with  the  type  of  U class 

— 3 ~3 

mentioned  in  the  opening  paragraph  of  this  section. 

The  proof  of  Theorem  1 can  be  based  on  the  following  two  auxiliary 
theorems  that  will  be  proved  in  the  next  two  sections. 

Theorem  2.  If  F.G  6 F,  if  F > G for  some  m € {1,2....}.  and  if  the 
— — — m 

k k 

moments  of  F and  G through  order  n + 1 >_  1 are  finite  with  yf  * yQ  for 
k * 0,1,..., n,  then  (-l)n  y°+1  > (-l)n  y*)+1. 

r (j 

Theorem  3.  Ijf  F.G  6 F,  if  F >n  G,  and  if  the  moments  of  F and  G through 

lc  k Il““  1 II 

order  n >_  1 are  finite  with  yF  - yQ  for  k - 0,1, ... ,n  - 1,  then  (-1)  y^ 

> (-I)""1  y“. 

The  latter  theorem  implies  that  y (F)  ■ y (G)  cannot  be  true  when  F > G, 

n n n 

and  the  former  theorem  says  that  if  F stochastically  dominates  G in  the 
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nonstrict  sense  for  any  finite  degree  and  if  y (F)  * y (G)  then  y!J+1  > 

n n F — 

y^"1  if  n is  even  and  y£+1  <_  y^+I  if  n is  odd,  provided  that  the  moments 

involved  are  finite.  Theorem  1 follows  immediately  from  Theorems  2 and 

3:  if  F > G then  F > G,  hence  either  y(F)  >*  y(G)  or  y (F)  • y (G)  by 
n — n n n n 

Theorem  2;  since  Theorem  3 rules  out  yn(F)  * yn(G)  we  are  with  y(F)  >* 

y(G). 


3.  Proof  of  Theorem  2 

We  begin  with  a lemma  that  leads  to  the  proper  sense  of  the  inequality 
in  the  conclusion  of  Theorem  2. 

Lemma  1.  For  any  H € F with  finite  moments  through  order  n and  for  all 
m > n ^ 0 and  x > 0 let 

Tn  m(H,x)  - l (-l)k+1(“\yj/xk  + j (1  - y/x)mdH(y).  (1) 

’ k»o  Vy  y*o 

Then  T (H.x)  » 0 iff  H(0)  - 1 and.  for  all  other  H € F.  T (H.x)  > 0 if 
n,m  — " n,m  “ 

n is  odd,  and  T _(H,x)  < 0 if  n is  even. 

n,m  — 

Proof.  If  H assigns  probability  mass  1 to  y ■ 0 then  (1)  gives  T (H.x)  « 

n,m 

k 

-1+1*0.  Assume  henceforth  that  H(0)  < 1.  When  y„  in  the  right  hand  side 

X . oo 

of  (1)  is  replaced  by  Jy  dH(y)  + Jy  dH(y),  and  (1  - y/x)m  is  expanded 

0 x 

binomlally,  we  get 

T (H,x)  - /{  l (-l)k(“Vy/x)k}dH(y)  + J‘{  l (-l)k+1f”Vy/x)k}dH(y) 
n’m  0 k-n+l  Vk/  x k-o  'V 

x 00 

* J'a  (z)dH(y)  + Jb  (z)dH(y) 
o " n,m 


(2) 
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where  the  A and  B terms  are  defined  in  context  as  the  sums  under  the 
integrals  and  z - y/x  with  z € [0,1]  for  A and  z € [ 1 ,°°)  for  B.  We  consider 
A and  B in  turn. 

First,  A (0)  * 0 for  all  m > n > 0 and,  since  A (z)  » (1  - z)m  - 1, 
n,m  — o ,ra 

A (z)  < 0 for  all  z € (0,1].  Since 
o ,m 

A'  (z)  ■ dA  (z)/dz  - -mA  , (z)  for  m > n > 1 and  z > 0, 

n,m  n,m  n-i,m-i  — 

it  follows  that  A'  (z)  > 0 for  z > 0,  hence  by  continuity  and  A (0)  - 0 
l ,m  l ,m 

that  A (z)  > 0 for  all  z € (0,1]  and  m > 1.  Then  A'  (z)  < 0 for  z > 0 

l , m 2 ,m 

and  m > 2,  so  that  A (z)  < 0 for  all  z € (0,1].  The  obvious  continuation 

2 ,m 

of  this  process  gives 

A (z)  > 0 for  all  z € (0,1]  if  n is  odd, 
n,m 

A (z)  < 0 for  all  z € (0,1]  if  n is  even, 
n,m 

along  with  A (0)  * 0. 
n,m 

For  the  B part  we  note  first  (e.g.  [Feller,  1957,  p.  61])  that  B (1)  * 

n,m 

(-l)n+1(m  Mso  that  B (1)  > 0 if  n is  odd,  and  B (1)  < 0 if  n is  even. 

\ n J n,m  n,m 

In  addition, 

B'  (z)  - -m  B (z)  for  m > n > 1 and  z > 1. 

n,m  n- i , m- i — — 

Since  B (z)  = -1,  this  implies  that  B'  (z)  > 0 for  m > 1 and  z > 1,  hence 
o,m  r i ,m  — 

that  B (z)  > 0 for  all  z > 1 since  B (1)  > 0.  Then  B'  (z)  < 0,  hence 
i ,m  — l ,m  2 , m 

B (z)  < 0 for  all  z > 1 since  B (1)  < 0.  The  continuation  of  this  process 
2 ,m  — 2 ,m 


shows  that 
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B (z)  > 0 for  all  z € [1,®)  if  n is  odd, 
n,m 

B (z)  < 0 for  all  z € [1,®)  if  n is  even. 
n,m 

These  conclusions  along  with  those  for  A and  the  hypothesis  that  H has  positive 
probability  mass  on  x > 0 then  yield  the  final  conclusions  of  Lemma  1 in  view 
of  (2).  Q.E.D. 

For  Theorem  2,  suppose  that  F,G  € F have  finite  moments  through  order 

lc  1c 

n + 1 _>  1 and  that  y (F)  = y (G) , i.e.  y = y for  all  k < n.  Assume  also 

n n r o 

that  F > G for  some  m.  Since  > C > , for  all  m,  select  m > n with 
— m — m — nrt-i 

F > , G.  Then,  by  the  definition  of  > . and  Fishburn  (1976), 

UTt  1 TUT  1 

X X 

J(x  - y)mdF(y)  < /(x  - y)mdG(y)  for  all  x > 0. 
o o 

Since  y (F)  - y (G) , it  follows  that 
n n 


T (G,x)  > T (F,x)  for  all  x > 0. 
n,m  — n,m 


(3) 


Suppose  first  that  G(0)  » 1 so  that  T (G,x)  = 0 by  Lemma  1.  Then 

n y in 

0 > T (F,x)  for  all  x > 0.  It  follows  that  either  F(0)  = 1 and  F = G,  in 
— n,m 

which  case  y”+1  ■ y!?+1  ■ 0,  or  that  F(0)  < 1,  in  which  case  Lemma  1 implies 
r G 

that  0 > T (F,x)  for  x > 0 and  hence  that  n is  even.  But  with  n even, 
n,m 

(~l)n  ylT1'1  > (-l)n  y"+1  * 0 in  the  latter  case.  Hence  the  conclusion  of 
r G 

Theorem  2 holds  if  G(0)  ■ 1. 

Assume  henceforth  that  G(0)  < 1,  in  which  case  F(0)  < 1 also.  Then  (3) 
applies  with  neither  term  ever  equal  to  zero  according  to  the  last  part  of 
Lemma  1.  We  shall  prove  that 


lim 

x-*» 


n.m 

T 

n,m 


(F.x) 

(G,x) 


y 


y 


n+i 

F 

n+i 

G 


(4) 
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which,  in  view  of  (3)  and  the  final  conclusions  of  the  lemma,  implies  that 

(-l)ny!!+1>  (-l)1^^1,  as  desired  for  Theorem  2.  Since  lim  T (H,x)  =* 

F — u n,m 

-y^  + 1 “ 0,  we  differentiate  the  numerator  and  denominator  of  the  left 


hand  side  of  (4)  with  respect  to  x in  anticipation  of  using  l'Hospital's 
rule.  When  m/x2  is  canceled  from  the  numerator  and  denominator  after  the 


differentiations  have  been  completed,  we  get 

*1  (-l)k+1(V)uF+1/xk  + /(1  " y/x)m_1ydF(y) 

" ■ ■■  ■ . .1  ■ 11  ■ • 

T (-Dk+1(m^)yg+1/xk  + /(1  - y/x)m-1ydG  (y) 


d T (F,x)/dx 
n.m 

d T (G,x)/dx 
n.m 


(5) 


If  n » 0 then  the  numerator  and  denominator  of  (5)  respectively  approach 
yi,  and  y*  as  x -*•  00  by  the  monotone  convergence  theorem  (e.g.,  [Loeve,  1960, 
p.  124]),  and  a single  application  of  l'Hospital's  rule  shows  that  (4)  holds. 

If  n > 1,  then  the  numerator  and  denominator  of  the  right  hand  side  of  (5) 
both  go  to  zero  (-y*  + yi  and  -y*  + y*)  as  n ■+•  °°.  In  this  case  methods 
similar  to  those  used  for  Lemma  1 show  that  both  parts  of  the  right  hand  side 
of  (5)  have  constant  nonzero  sign.  We  then  differentiate  the  numerator  and 
denominator  of  the  right  hand  side  of  (5)  and  cancel  (m  - 1 )/x2  to  get 

7 + /a  - 

T <-l)k+1("“2)u^!/xk  + j(l  - y/x)"'!yJdC(y) 

If  n ■ 1 then  (4)  holds  by  two  applications  of  l'Hospital's  rule  since  the  limit 
of  the  expression  just  written  is  y2/y^.  If  n > 1,  we  continue  in  the  indicated 
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A 

manner  until  we  arrive  at  the  ratio  J (1  - y/x)n,"n“1yn+1dF(y)  /J'(l  - y/x)m_n'1yn+1 
dG(y),  whose  limit  is  Up+1/y”+1.  Hence  n + 1 applications  of  1’ Hospital's 
rule  yield  a general  verification  of  (4) . 

4.  Proof  of  Theorem  3 

Throughout  this  section  we  use  the  convention  that  t°  = 1 when  t * 0.  In 

anticipation  of  using  the  monotone  convergence  theorem  later  we  shall  first 
prove 


Lemma  2.  Suppose  H € F has  finite  moments  through  order  n - 1 with  n > 1, 
and  0 £ y £ x with  x > 0.  Then 

(-l)n[nlHn(y)  + J (-l)n*k+‘(")  kp"'kyk'‘ ] 
k«i  H 

> + l <-l)”‘k+,(")  kyk-  /z"-kdBU)] 

k»i  z”0 

> 0. 


Proof.  We  note  first  that 
n 

,n-k+i/n\  . n-k  k-i 


l (-Dn"K+1(")  ky-V1  - n T(-Dn-k(n->kU 


,n-k/n-i\„k  n-k-i 
JH 


n-k+i/n\  . k-l  ? n-k 


l (-Dn"K+I(J)  kyfc-1  f zQ-k  dH(z)  - n Y (-1) n~k(n“ 1 ) yk  / 
-l  z*0  k-o  \k/  2,0 


dH(z) 


n!Hn(y)  - n J (y  - zJ^dHCz) 
z*o 


-n  J‘  (y  - z)n~dH(z) , 
z=»o 

([Fishburn,  1976]). 


By  the  first  two  of  these,  the  first  inequality  in  the  conclusion  of  Lemma  2 
holds  if  and  only  if 


which  is  true  if  and  only  if 


T (-1  > o. 

k-o  v K 7 2=v  ~ 


But  this  is  true  since  its  left  hand  side  equals  J‘  (z  - y)n~ldH(z).  By 

z=*x 

the  final  two  expressions  in  the  opening  sentence  of  this  proof,  the  second 
(>  0)  inequality  in  Lemma  2 holds  if 


(-l)n[-n  J (y  - z)n_1dH(z)]  > 0, 


which  is  true  since  its  left  hand  side  equals  n J (z  - y)n-1dH(z).  Q.E.D. 

z-y 

When  dy  is  replaced  by  e^d(l  - e in  the  following  expressions,  with 

1 - e a distribution  function  in  F,  Lemma  2 in  conjunction  with  the  monotone 

convergence  theorem  shows  that  if  is  finite  then 

H 


<-1)"  / [n!Hn(y)  + j?  <-l)n-fcfl(?)ku?V',]<ly 

y-o  k=»i  Vk/  H 


U»  U-l)”  J'  (n!Hn(y)  + J / z"-kdH(z) ]dy> . 

v-*on  i V K/ 


The  expression  within  braces  after  lim  reduces  as  follows: 


(-lAnm^’W  + l (-1) -"‘m/  1 zn-kdH(z)  ] 
k* l z*  o 


,n-k+i/n\  k c n-kj 


(-!»”[  / <x  - y ) ndH (y ) + f (-l)"-k+,(")zk  /y"-kdH(y) 
ym0  k* l K/  y*o 


11 


(-1) 


”[7  l <-l)"-k(”)  xVk<iH(y)  + I <-l)"-k+'(")xk/y"-kdH(,)l 

Ly«  k-,  v y-«  J 


J yndH(y). 
y-0 


Therefore,  when  y**  1 is  finite, 
H 


; !«!."(,)  + I kU"-k,k-']dy  - (-l)V 

jy=0  k-1  V1C/  ” H 


(6) 


Finally,  as  in  the  hypotheses  of  Theorem  3,  suppose  that  F > G and 

n 

that  the  moments  of  F and  G through  order  n are  finite  with  yk  =•  yk  for  all 

F G 

^ 1 n * 1*  Then,  since  F (y)  G (y)  for  all  y,  and  Fn(y)  < Gn(y)  in  a non- 
degenerate  interval  within  [0,°»),  (6)  shows  immediately  that  (-l)nyn  < (-l)n  y11. 
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